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CROSS REFERENCE TO ATTACHED APPENDIX 

Appendix A contains the following files in one CD- 
ROM (of which two identical copies are attached 
10 hereto) , and is a part of the present disclosure and is 
incorporated by reference herein in its entirety: 

Volume in drive F is VOL2 

Volume Serial Number is 0A0A-FA32 



15 



30 



Directory of F:\ 



01/17/01 12:01p <DIR> 

01/17/01 12:01p <DIR> 

20 01/09/01 10:49a 228,371 ms_svr . c 

01/05/01 03: lip 6,959 ms_svr . h 

12/01/00 03:10p 7,783 mshdr . h 

12/01/00 03:10p 1,611 mstypes.h 

12/01/00 03:10p 5,761 rtphdr . h 

25 7 File(s) 250,485 bytes 

Total Files Listed: 

7 File(s) 250,485 bytes 



12,514,230,272 bytes free 

The files of Appendix A form source code in the 
language C for implementing a media stream server, call 
controller and media stream control, for use with an 
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illustrative implementation of this invention as 
described more completely below. 

A portion of the disclosure of this patent 
document contains material which is subject to 
copyright protection. The copyright owner has no 
objection to the facsimile reproduction by anyone of 
the patent document or the patent disclosure, as it 
appears in the U.S. Patent and Trademark Office patent 
files or records, but otherwise reserves all copyright 
rights whatsoever . 

BACKGROUND OF THE INVENTION 

A PBX (private branch exchange) provides 
interconnections among internal telephone lines that 
are connected to telephone instruments at a single 
facility (such as a law office) . The PBX also 
interconnects the internal telephone lines to a smaller 
number of external telephone lines (also called 
"trunks" ) of a telephone company. Such PBXs provide a 
number of features of the type described in # for 
example, "DEFINITY Communications System, Generic 1 and 
Generic 3 and System 75, 8410 Voice Terminal User's 
Guide," pages 5-8, 1994, published by AT&T GBCS 
Documentation Development, Middletown, New Jersey, 
07748-1998 . 

There is a growing trend towards audio 
communications taking place over packet -switched 
networks, such as the Internet, instead of directly on 
the telephone network (that provides only circuit 
switching and is sometimes called "public switched 
telephone network (PSTN)"). Such audio communications 
can be facilitated by various types of devices such as: 
(1) specialized PBXs (also called "packet-switching 
PBXs" ) that directly connect to packet-switched 
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networks, (2) gateways that connect circuit -switching 
devices to packet -switched networks and (3) software 
tools that connect personal computers to packet - 
switched networks. One example of a packet -switching 
5 PBX, as seen in FIG. 1, is described in "Intranet and 
IP-Based UnPBXs," Chapter 7, pages 7-16 to 7-22, in the 
book entitled "the UnPBX" edited by Edwin Margulies, 
Flatiron Publishing, Inc. 1997. 

In this example, the packet -switching PBX includes 
10 one or more telephony switches 1 and 2, each of which 
has twelve ports that can be connected either to 
13 internal telephone lines or to external telephone 

lines. In addition, each of telephony switches 1 and 2 
^2 includes a digital port that is connected to an 

O 15 Ethernet 3 for communication therebetween. For 
j"^ example, if telephone instrument 4 needs to be 

, connected to telephone instrument 5, switch 1 routes 

!-f the call via Ethernet 3 to switch 2. Information 

carried by any call routed over Ethernet 3 is divided 
-•j=? 20 into a number of portions; each portion placed in a 

Id 

{«£. packet (such as a Universal Datagram Protocol (UDP) 

packet conforming to the Transmission Control Protocol 
(TCP) Internet Protocol (IP) used over the Internet) 
that is transmitted between switches 1 and 2. TCP and 

25 UDP both travel on top of the IP. TCP is a session- 

and connection-oriented protocol. UDP carries packets 
with nothing in the UDP portion itself to identify any 
stream, or connection. 

An example of a conventional gateway for circuit - 

3 0 switching PBXs is an Internet Telephony Server which 

includes a PSTN interface board 11, as seen in FIG. 2, 
for connection to telephone lines (Tl/El/analog) of a 
PBX, and a Digital Signal Processor (DSP) card 12 that 
performs voice compression and/or fax processing and 
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generates packets, and the packets are sent to an 
Ethernet 13 via an Ethernet card 14 . 

One example of a software tool for use in a 
personal computer is an audio conferencing tool 
5 described in "vat - LBNL Audio Conferencing Tool", 
published May 1996 and available at http://www- 
nrg . ee . lbl .gov/vat . The packets generated by this tool 
conform to the Real-Time Transport Protocol (RTP) as 
described in "RTP: A Transport Protocol for Real-Time 
10 Applications", Network Working Group, January 1996, 

which is available from the Internet Engineering Task 
'3 Force (IETF) website as Request for Comment (RFC) 1889, 

http://www.ietf.org /rf c/rf cl889 . txt ?number=1889 . RTP 
provides end-to-end network transport functions 

sis 

p 15 suitable for applications transmitting real-time data, 
I'H such as audio, video, or simulation data, over 

multicast or unicast network services. 
= -™ Data packets sent on an RTP stream carry only a 

i= hint as to which call the data packets relate to. In 

2 0 RTP packets, the hint comes in the form of 32 bit 
H (i.e., 4 byte) identifiers which help locate the 

generator of the data in the packet. However, as the 
identifier is not centrally generated, cases arise 
where two sources generate the same identifier, and 
25 thus ^collide' on the use of a particular pattern in 

these 32 bits. In this case, the sources simply choose 
a new identifier, and the recipients are left to figure 
out what has happened. RTP itself does not contain a 
mechanism to keep the identifiers unique. 
30 Classical RTP requires that either all ports of 

the firewall be opened or that the firewall recognize 
that the incoming data is RTP data. Firewalls 
typically operate by rejecting (i.e., discarding) 
incoming traffic which is destined for ports (either 
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TCP or UDP) whose operational security is not known. 
Essentially, the firewall closes the ports for incoming 
traffic unless someone (e.g., the administrator of the 
firewall) can vouch that the destination port has a 
trusted program digesting the information. With RTP, 
the ports are 'unspecif ied' , so the administrator could 
end up with having to open up many, or all of the UDP 
ports to allow the voice traffic to come in. While a 
field in the IP header of a UDP packet explicitly 
states that the following payload data is UDP payload 
data, there is no further identification in the UDP 
packet that the UDP payload data is RTP data. RTP data 
travels inside a UDP datagram as a payload of UDP. 
Since there is no tag or identifier in the incoming 
data packet itself that identifies the incoming data as 
RTP data, the destination must be informed that 
incoming data is RTP data. If the receiving system does 
not know that data coming in on a particular port is 
RTP data, then the receiving system usually discards 
such data. There are a few heuristics which can be 
applied to the RTP header to take a guess as to whether 
it is RTP. Routers use these heuristics to compress 
RTP headers, but their compression techniques, when 
inadvertently applied, are not destructive. However, 
the routers are only looking for compressible data and 
not looking at the data in order to route the data or 
identify the data as belonging to any particular call's 
media stream. The routers merely seek to exploit the 
compressible aspects of an RTP header, if in fact, 
there is such a header on a particular packet. 

SUMMARY OF THE INVENTION 

A method of sending telephony traffic over 
Ethernet is disclosed. A fixed destination port for 
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telephony traffic is created at a destination. A fixed 
destination port for call control traffic is also 
created at the destination. A plurality of media 
streams are received at the telephony fixed destination 
5 port. The source of each media stream is commanded to 
provide a unique identifier for each media stream 
arriving at the destination from each source. Each 
media stream is identified by a unique identifier 
provided by the source. Transmission of packetized 
10 data to a fixed destination port for all audio traffic 
and all call control traffic allows the number of holes 
in the firewall of a system to be kept to a minimum. 

A method of limiting telephony traffic to a single 
JJjj hole in a firewall is also disclosed. A fixed 

O 15 destination port is created for telephony traffic at a 
I'fz destination. A first hole is opened in a firewall 

s corresponding to the fixed destination port for 

telephony. A fixed destination port is created for 
H call control traffic at the destination. A second hole 

'M 2 0 in the firewall is opened corresponding to the fixed 
H destination port for call control. A plurality of 

media streams are received at the telephony fixed 
destination port. .The source of each media stream is 
commanded to provide a unique identifier for each media 
25 stream arriving at the destination from each source. 

Each media stream is identified by a unique identifier 
provided by the source of each media stream. 

This invention will be more fully understood in 
light of the following detailed description taken 
3 0 together with the accompanying drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 illustrates a high-level block diagram of a 
packet -switching PBX of the prior art. 
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FIG. 2 illustrates, in a three-dimensional cut- 
away view, the cards included in a gateway of the prior 
art . 

FIG. 3 illustrates a telephony system in which 
5 media and call control streams must pass through a 
firewall . 

FIG. 4 illustrates media and control streams 
running between two locations. 

FIG. 5A illustrates an example of a call control 
10 mechanism operating between two locations. 

FIG. 5B illustrates an example of a call control 
O mechanism operating between two locations. 

FIG. 6 illustrates an example of the operation of 

^ a media stream server. 

ifl 

p 15 FIG. 7 illustrates an example of the command task. 

FIG. 8 illustrates an example of the receiver 

task . 

FIG. 9 illustrates an example of the sender task. 
FIG. 10 illustrates an example of the call 
20 controller. 

FIG. 11 illustrates that the receiver task going 
through the linked lists to determine find the remote 
node . 

FIG. 12 illustrates how the receiver and player 
25 tasks use the linked lists to send each media stream to 
an appropriate destination. 

Use of the same reference symbols in different 
figures indicates similar or identical items. 

3 0 DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

IP nodes (e.g., telephony switches) are connected 
by IP networks over a number of different systems, 
e.g., Ethernet 10/100/1G, Tl, DSL, Cable, PSTN, etc. 
Audio communication over packet -switched networks can 
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involve an audio stream passing through a firewall. 
Firewalls exist at the boundaries between a trusted 
internal internet and an untrusted external internet . 
Firewalls are controlled by a combination of software 
5 and hardware. A firewall is programmed to allow 

packets destined for known ports to enter those ports 
and to reject those packets with unknown destination 
ports. Firewalls protect against incoming traffic only 
and have filtering rules as to what packets are to be 
10 let through. Filtering rules are implemented by a 
system manager. Incoming traffic authorized by the 
filtering rules of the firewall to enter the 
destination computer system is said to pass through a 
52 "hole" in the firewall. Packets in the incoming 

O 15 traffic are pre-set at the source computer with the 
:/t target port number and/or IP address of the destination 

= computer. If the firewall guarding the destination 

I]? system does not recognize the incoming packets as 

authorized to enter the destination port number (i.e., 

=0 

^ 20 not authorized to pass through the "hole" in the 
\*± firewall) to which the packet has been directed by the 

source computer, the packet will not be allowed through 
(i.e., processed any further) and the packet will be 
dropped. 

2 5 One goal of the system administrator of a computer 

network is to limit unauthorized outside access to the 
computer system. The system administrator starts with 
a completely closed system (i.e., no outside traffic is 
allowed in) and opens up a number of ports to allow 

30 access for web traffic and Telnet access. TCP port 80 
is the port number commonly assigned by system 
administrators for incoming web traffic. Other port 
numbers may be used for File Transfer Protocol (FTP) 
and Telnet access. The system administrator obtains 
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these commonly assigned port numbers from industry- 
guidelines which suggest which port numbers should be 
used for a particular use. The Internet Assigned 
Numbers Authority (IANA) houses the many unique 
5 parameters and protocol values necessary for operation 
of the Internet and its future development. These 
parameters and protocol values may be found at 
http://www.iana.org/numbers.htm. Types of numbers 
range from unique port assignments to the registration 
10 of character sets. A copy of the list of port numbers 
may be found at http://www.isi.edu /in-notes/iana/ 
O assignments/port -numbers . It is from these lists that 

Cl port numbers were obtained for use in conjunction with 

u ' an embodiment of the invention, as seen below, 

p 15 In accordance with the invention, an embodiment of 

the invention includes using RTP protocol in a manner 
s that allows all incoming audio traffic to be directed 

j[f through a single "hole" in the firewall (i.e., directed 

\^ to one specific port) set aside for incoming audio 

]^ 20 traffic at a destination system. 

!« As data packets sent via UDP are not self- 

identifying in RTP, it is necessary for the source 
system to send a control message via TCP to the 
destination system that identifies the incoming media 

25 stream of data packets. The control message identifies 
the data packets by the IP address of the source system 
and the source port number from which the data packets 
originate. The call control message is sent via a call 
control mechanism that informs the destination (i.e., 

30 receiving) system which data packet corresponds to a 
particular call (i.e., media stream). 

The call control mechanism communicates 
information about the call (i.e., call identifier, 
source port number, source IP address, destination port 
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number, destination IP address) to both the source and 
destination systems. The communication of this call 
control information between systems is referred to as 
"call control" . Call control is transmitted between 
5 the source and destination systems through a second 

"hole" in the firewall (i.e., a second specific port in 
each system) set aside for all call control traffic. 
This second hole uses a destination port number set 
aside for TCP protocol . The destination port number 
10 for TCP is always a specific port number (e.g., port 
number 5002) chosen by the user for system-wide use. 
'3 The call control stream from the source (i.e., sender) 

to the destination (i.e., receiver) identifies the 

■l_ s source IP address, the source port number of the sender 

in 

p 15 and a call identifier (which is a token 32 bits) to the 

\ u a receiver. A control signal can be received either 

= before or after the data traffic starts. However, the 

H receiving IP telephony switch discards any data traffic 

5 

M received before the control signal is received as the 

J5f 2 0 switch is sensitive to both IP address and port number. 
f=^ RTP allows a new source port number to be used every 

time a telephone call is established. The source and 
destination media streams are identified by their IP 
addresses and UDP port numbers. The destination port 
25 number for UDP is always a specific port number (e.g., 
port number 5004) chosen by the user for system-wide 
use, but the source port number for UDP traffic varies 
for each call. This specific port number is chosen 
pursuant to the IANA guidelines mentioned above. Each 
30 media stream has its own unique pair of IP addresses 
and UDP source port number. The destination port 
number chosen by the user for the call control stream 
on TCP is port 5002 but any port number may be chosen 
by the user, pursuant to the IANA guidelines mentioned 



-10- 



M-8823 US 
635534 vl 



above. The actual call control mechanism used to 
transport the above -identified information is not a 
critical aspect of one embodiment; the only addition to 
a normal call control mechanism, in one embodiment, is 
5 transfer of information relating to the source port 
number of the sender. 

In accordance with one embodiment, a call control 
mechanism may be based on International 
Telecommunication Union (ITU) Recommendation H.323. 
10 The ITU is an international organization within which 
governments and the private sector coordinate global 
? 2 telecom networks and services. ITU Recommendation 

''•'4 H.323 (11/00) "Packet-Based Multimedia Communications 



Systems" discloses a conventional method of 
15 transmitting destination port information over call 

control. This method may be modified to transfer source 
port information of a sender, as described elsewhere 
herein. 

In accordance with one embodiment, the call 
2 0 control mechanism is based on Request for Comments 

(RFC) 2543: "Session Initiation Protocol" (SIP) (March 
1999) developed by the Internet Engineering Task Force 
(IETF) , an international community of network 
designers, operators, vendors, and researchers 
25 concerned with the evolution of the Internet 

architecture. SIP discloses an application-layer 
control (signaling) protocol for creating, modifying 
and terminating sessions with one or more participants. 
These sessions include Internet multimedia conferences, 
30 Internet telephone calls and multimedia distribution. 
Members in a session can communicate via multicast or 
via a mesh of unicast relations, or a combination of 
these. SIP invitations used to create sessions and 
carry session descriptions which allow participants to 
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agree on a set of compatible media types. SIP supports 
user mobility by proxying and redirecting requests to 
the user's current location. Users can register their 
current location. SIP is not tied to any particular 
conference control protocol. SIP is designed to be 
independent of the lower- layer transport protocol and 
can be extended with additional capabilities. RFC 2543 
is available at http : / /www . ietf . cnri . res ton . va . us/ rfc/ 
rf c2543 . txt ?number=2543 and discloses a conventional 
method of transmitting destination port information 
over call control . This method may also be modified to 
transfer source port information of a sender, as 
described herein. 

This system of media stream identification, based 
on pairs of IP addresses and UDP source port numbers, 
is not limited to audio traffic, but is also applicable 
to video stream data. 

In accordance with the invention, FIG. 3 
illustrates an example of a telephony system in which 
packets containing audio information must pass through 
a firewall . The system includes a plurality of source 
telephone instruments 30A, ... 301, ... 30N (A < I < N 
where N is an integer) , a plurality of source PBX 31A, 
... 311, ... 31N (where A < I < N) , a plurality of source 
firewalls 32A, ... 321, ... 32N (where A < I < N) , Internet 
33, destination firewall 34, destination PBX 35, and 
destination telephone instruments 36A, ... 361, ... 36N 
(where A < I < N) connected to PBX 35. The term 
"source" is used merely to illustrate the system from 
which a call was initiated and the term "destination" 
likewise merely refers to the system to which a call 
was directed. Each PBX 31A, ... 311, ... 31N, 35 has a 
configuration database 37A, ... 371, ... 37N (where A < I < 
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N) , 38 which contains a list of the port numbers of all 
extensions in the system. For example, configuration 
database 37A of source PBX 31A contains a list of port 
numbers for extensions of PBXs 31A, ... 311, ... 31N, 35. 
5 Likewise, configuration database 38- of the destination 
PBX 3 5 contains a list of port numbers for extensions 
of both PBXs 31A, ... 311, ... 31N, 35. As stated in more 
detail below, the configuration databases 3 7A, ... 371, ... 
37N, 38 and firewall settings may be adjusted at any 
10 time by a system administrator. For example, a port 

number could be set in a configuration database 37A, ... 
'3 371, ... 37N, 38 at the time the hole is inserted in the 

^dj firewall by the system administrator. Media stream 

-• S3. 

jtj Ethernet taps 39A, ... 391, ... 39N (where A < I < N) and 

O 15 call control stream Ethernet taps 96A, ... 961, ... 96N 
IU 

'--■-J (where A < I < N) are located between the Internet 33 

U and PBX 31A, ... 311, ... 31N. Ethernet taps 87, 88, 

H dedicated to media stream traffic and call control 

\ n stream traffic, respectively, are located between 

O 20 destination firewall 34 and PBX 35. There are handset 
taps 49A, ... 491, ... 49N (where A < I < N) located 
between telephone handsets 30A, ... 301, ... 30N and PBXs 
31A, ... 311, ... 31N. Handset taps 90A, ... 901, ... 90N 
(where A < I < N) are located on PBX 35. A line from 
25 each handset tap 90A, ... 901, ... 90N runs to a respective 
destination handset 36A, ... 361, ... 36N. PBXs 31A, ... 
311, ... 31N also have taps 40A, ... 401, ... 40N (where A < 
I < N) for analog links to the PSTN. PBX 35 also has a 
tap 89 for analog links to the PSTN. Media streams 
30 98A, ... 981, ... 98N (where A < I < N) and call control 

streams 99A, ... 991, ... 99N (where A < I < N) run from 
each PBX 31A, ... 311, ... 31N to the Internet 33. The 
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media streams 98A, ... 981, ... 98N pass through a 
different hole in destination firewall 34 than call 
control streams 99A, ... 991, ... 99N. However each media 
stream 98A, ... 981, ... 98N passes through a common hole 
5 in firewall 34 to enter Ethernet tap 87 and every call 
control stream 99A, ... 991, ... 99N passes through a 
common hole in firewall 34 to enter Ethernet tap 88. 
This system can be used with, but is not limited to, 
one-on-one calls, conference calls involving more than 
10 two phones, calls from a single source to multiple 
extensions at a destination, a call from a single 
^ source to individual extensions at multiple 

*%J destinations, and multiple independent telephone calls 

)t: (e.g., Call A from Source A is directed to extension 3 

O 15 at Destination C while Call B from Source B is directed 

111 

\i to extension 5 at Destination C) . 

5 For purposes of clarity, the following description 

relates to a call from one telephone instrument to 
H another telephone instrument. The initiator of a 

^ 20 telephone call picks up the handset of the first 
I s4 telephone 3 OA. Generation of a dial tone occurs in the 

telephony switch to which the handset is connected. 
The media stream is not established until the user 
Mials a number', e.g., the three-digit extension of a 
25 colleague in a remote office. Generators of the media 
stream always send the data to the same destination 
port number regardless of which PBX to the media stream 
is directed to. Each PBX is set to receive incoming 
traffic at the same destination port number every other 
30 PBX in the system is set to receive incoming traffic 

at. This UDP destination port number for the incoming 
RTP data is compiled into computer code (e.g., language 
C) and programmed into each PBX. In the alternative, 
the destination port number for UDP traffic could be 
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chosen at runtime and sent between telephony switches, 
i.e., the destination port number could be chosen 
during the runtime of the telephony switch, as opposed 
to the destination port number being assigned some 
5 value during the compilation of software which runs in 
the switch. Software compilation occurs at the factory 
of the switch manufacturer. The resulting software 
program is loaded into the telephony switch. After 
power is applied to the switch, the program runs, and 

10 eventually, assigns these port numbers. This 
destination port number is provided to the 
administrator of the firewall who then configures the 
firewall to allow incoming RTP data traffic through to 
the designated destination UDP port number. The 

15 ultimate destinations of the media streams may differ 
but the same destination port number on each PBX is 
used for incoming traffic. The initiator of the call 
then dials a desired number or extension and the 
telephone 36N at the destination rings regardless of 

20 where the destination telephone 3 6N and its associated 
PBX 35 are located. The telephone number dialed is the 
destination ID of the telephone 3 6N to which the call 
is directed (e.g., going to a particular destination 
PBX 35 and physical port number (e.g., port number 3) 

25 of the destination PBX 35) . The physical 'ports' on 
the PBX are also referred to as ^slots' (i.e., port 
number 3 is slot number 3) . The dialed number is sent 
to a call control mechanism located within the PBX 31A 
connected to the call initiator's telephone. The call 

3 0 control mechanism uses programmed instructions to 

determine the destination IP address from the dialed 
number by searching for the dialed number in the 
configuration database 37A of the PBX 31A. The search 
yields the IP address and a unique 32 -bit identifier of 
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the destination (i.e., dialed) telephone 3 6N. Each 
telephone extension in the system is associated with an 
IP address as well as a 32 -bit identifier unique to 
that extension. The call control mechanism sends 
5 messages over call control stream 99A to destination 
PBX 35 using TCP protocol. In order for media stream 
98A to be received at the destination telephone 36N, 
media stream 98A must pass through a firewall 34 
guarding the destination system. Although there might 
10 be a firewall 32A guarding the call initiator' s system, 
the firewall 32A guarding the call initiator' s system 

» 

~ ignores media stream 98A from the caller's PBX 31A to 

0 

*j the receiver's PBX 35 as the call is outgoing and not 

SJS, 

incoming. However, as RTP is unidirectional, the 
□ 15 receiver will generate its own media stream to 

communicate with the caller. This second media stream 
(not shown for clarity) must pass through the firewall 
J 32A guarding the call initiator's system. The receiver 

^ of the call (i.e., destination telephone 36N) is 

n 

S s 2 0 informed of the source port number for each individual 
^ call originating elsewhere by an external mechanism 

(i.e., a call control mechanism) pre-programmed into 
and distributed across all of the switches. Each PBX 
involved in a telephone call functions as part of the 
25 call control, as stated in more detail below. The 
individual PBX' s call control function depends oh 
whether the PBX is the PBX sending a call or the PBX 
receiving a call . The call control mechanism is 
programmed such that a source port number identifying 
3 0 the source of a call is generated by the source PBX 31A 
and then communicated to the destination PBX 35. This 
call control is done through a second hole in the 
destination firewall through which the TCP messages 
pass to the call control mechanism in the destination 
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PBX 35. The two PBX 31A, 35 communicate with each other 
and exchange messages through the call control holes in 
the firewalls 32A, 34. The destination PBX 35 then 
knows a call is coming and generates signals to the 
5 destination telephone 36N. It is the destination PBX 
35 that rings the attached telephone 3 6N. The media 
stream starts when the destination telephone 3 6N is 
answered. Each firewall needs two holes: one for TCP 
port number 5002 and one for UDP port number 5004. The 

10 distributed call control (i.e., the communication of 

call control between the PBXs involved in the call) is 
communicated to TCP port number 5002; consequently- 
passing through the hole in the firewall corresponding 
to TCP port number 5002. The media streams (all of 

15 them) are transmitted through the firewall hole for UDP 
port number 5004 . Call control communication is 
transmitted over TCP protocol to TCP port number 5002 
and the media stream RTP is transmitted over UDP 
protocol to UDP port number 5004. Essentially, the 

2 0 port number and IP address of the sending PBX 31A is 
communicated to the receiving PBX 35. The receiving 
PBX 3.5 uses this information to identify the call 
(i.e., the receiving PBX 35 can find all other data 
associated with the call based on this information 

25 alone) . 

The 32-bit call identifier is unique across all 
PBXs within a single customer environment. Every time 
a new call is initiated, a new call identifier is 
created. For example, the new call identifier may be 
30 created by incrementing the previous call identifier by 
one. The call identifier, used as a "tag", or " index" 
for the call, is unique among all other extant calls in 
the telephony system. Inside the system, there are 
several tables that contain information about pieces 



-17- 



M-8823 US 
635534 




and parts of each call. These tables usually contain 
the call identifier, so that any particular piece can 
be correlated with the call. The call may include 
several media streams, each stream with its own 
5 endpoint port number. But, wherever a data structure 
describes such a media stream, that data structure is 
indexed (i.e., tagged) with the call identifier. 

An endpoint exchange (i.e., the communication of 
the source port number to the destination PBX 35) 
10 occurs when the receiver of the call picks up the 

telephone 3 6N. The source port number is generated by 
^2 the initiator PBX 31A and retained in the initiator PBX 

''•■-4 switch 31A. The call control mechanism is used to 

]^ establish the two independent media streams, by 

O 15 symmetric exchange of IP address and UDP port number 
Vfs pairs. The result is two media streams; the 

5 initiator' s voice to the receiver' s earpiece and the 

|T receiver's voice to the initiator's earpiece. The 

I s * symmetric establishment of the two media streams take 

=0 

=*% 20 place concurrently, not sequentially, and occurs before 
H the receiving telephone is answered. Since the 

destination port number of each PBX 31A, 3 5 is always 
the same (e.g., TCP port 5002 and UDP port 5004), 
neither the initiator's firewall 32A nor the receiver's 
25 firewall 34 needs to be adjusted. 

In accordance with the invention, while the above 
call is in progress, a second telephone call may be 
placed from handset 301 to destination handset 36A and 
a third call from handset 30N to destination handset 
30 301. The second and third calls each obtain a unique 
source port number in the manner outlined above. The 
call control communication for each call, regardless of 
whether the source handset is 3 OA, 3 01 or 3 ON, is 
transmitted over TCP protocol to TCP port number 5002 
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of PBX 35 and the media stream RTP is transmitted over 
UDP protocol to UDP port number 5004 of PBX 35. 

In accordance with the invention, as seen in FIG. 
6 and discussed in more detail below, two important 
5 processes are running in each PBX: a receiver task 63 
and a sender task 64 . The receiver task 63 takes data 
packets off of UDP port 5004 and sends the packets to 
their appropriate destination based upon source port 
number and source IP address. The sender task 64 
10 generates the source port number so that each packet on 
a media stream may be uniquely identified for the 
O receiver task 63. Each PBX 31A, ... 311, ... 31N, 35 

performs the sender task 64 for each media stream 45, 
46 generated by PBX 31A, ... 311, ... 31N, 35 and each PBX 
15 31A, ... 311, ... 31N, 35 performs the receiver task 63 for 
each media stream 45, 46 received by PBX 31A, ... 311, ... 
31N, 35. 

There is a potential timing delay because data 
could arrive at firewall 34 of the destination system 
20 before the destination system is notified that data is 
coming. If that incoming data arrives before the call 
control mechanism is made aware of an incoming call, 
the incoming data is dropped by the destination system. 
The timing delay occurs with respect to the timing of 
25 (a) the source port being communicated to the receiving 
system and (b) the media stream flowing to that 
receiving system. For example, if the media stream 
packets for a new stream arrive at the receiving system 
before the distributed call control has communicated 
3 0 the new source port number, then the receiving PBX 3 5 
drops these packets as the receiving PBX 35 requires 
the source port number to identify which conversation 
the media stream belongs to. The receiving PBX 3 5 must 
be informed of the source parameters (i.e., port number 
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Sc IP address) as these parameters identify the call and 
where data for the call is originating. 

In accordance with the invention, a system can be 
run with one or more PBX . Two PBXs can be in the same 
5 building or in different buildings or even in different 
cities and the collection of these PBXs, and associated 
equipment, make up the phone system. The phone systems 
(PBXs) communicate with each other via transmission 
control protocol (TCP) internet protocol (IP) . TCP/IP 
10 and Universal Datagram Protocol (UDP) /IP are standard 
methods of sending data (e.g., voice), including 
*j| streaming media. In the layering approach to 

^-S networking protocols, TCP travels on top of IP. UDP 

ifl also travels on top of IP. Layering of network 

^ 15 protocols is common practice and well-known in the art. 

fy 

*d| Voice data is usually sent over UDP/IP, although it can 

also be sent over TCP/IP. However, the UDP/ IP domain 
is preferred as UDP/IP is better suited to the timely 
^ aspects of conversational audio. The salient timely 

p 20 aspect of conversational audio is: (1) if the data is 
1=5 late, the data is useless. TCP functions so as to 

guarantee the correct, ordered delivery of data (even 
in the face of network loss or temporary outages) 
though at the cost that the data may come slowly. In a 
25 voice system, the timely delivery of data is of high 
importance to maintaining a coherent conversation. A 
user can have a PBX at the home of the user, connected 
to an internet service provider (ISP) via Digital 
Subscriber Line (DSL) . Two or more PBX communicate 
30 using their Internet addresses. As seen in FIG. 3, PBX 
31A, ... 311, ... 31N communicate with another PBX 35 via 
IP over Ethernet. The PBXs 31A, ... 311, ... 31N, 35 
identify each other from their IP addresses. A PBX can 
be hooked to Tl lines, analog lines, etc. which are 
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controlled by the telephone companies. These "taps" 
3 9C to PSTN (publicly switched telephone network) 
central office are not the focus of the instant 
invention as there is no IP traffic. These "taps", or 
5 ports are software configurable, so that a handset port 
can act as a trunk link back to the central office. 

The configuration database 37A, ... 371, ... 3.7N, 38 
of each PBX 31A, ... 311, ... 31N, 35 contains the IP 
address of all the other PBXs that make up the PBX 
10 system for a particular business and where every 
telephone extension in the system is located. A 
"telephony management system" (TMS) (not shown) is 
''••-J implemented as a personal computer (PC) with a user 

i — 

jJJ management interface, including but not limited to a 

O 15 Graphical User Interface (GUI) , to inform the TMS about 

;~, any new PBX added to the system. The TMS also updates 

s the configuration database 37A, ... 371, ... 37N, 38 of 

Y2 each PBX as to the IP address and telephone extensions 

\^ associated with the new PBX. This information is 

jSj 2 0 entered into the system manually by the system 

1^ administrator or someone similar who uses the 

management interface in the telephony management system 
(TMS) to add the new PBX. The exact manner of how the 
PBX is connected to the Internet is not essential to 
25 the practice of this invention. 

In accordance with an embodiment of the invention, 
the following example is an illustration of the 
telephony system of FIG. 3. The system is comprised of 
an office in San Jose (SJ) and an office in Boston. 
30 Each office has a PBX 42, 41, respectively. Each PBX 

has the IP address of the other PBX and a list of local 
extensions in its configuration database. The 
configuration database essentially includes a table of 
telephone extensions and their corresponding IP 
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addresses. In accordance with an embodiment of the 
invention, each PBX in the telephony system is 
identical; made by the same manufacturer and programmed 
with the same instructions and data. In an alternative 
5 embodiment, in a telephony system with at least two 
PBX, at least one PBX in the telephony system is made 
by a different manufacturer from the at least one other 
PBX in the system but all PBX in the system are 
programmed with identical instructions and data. 
10 In accordance with the invention, as seen in FIG. 

4, two media streams and two control streams are 
^ transmitted between the Boston PBX 41 and SJ PBX 42 

*tj after a call has been established. The control stream 

~ 43 originating at the Boston PBX 41 is directed to the 

f3 15 TCP port 5002 of the SJ PBX 42 while the media stream 
Jf g 45 originating at the Boston PBX 41 is directed to UDP 

' port 5004 of the SJ PBX 42. Likewise, the control 

j]I stream 44 originating at the SJ PBX 42 is directed to 

if the TCP port 5002 of the Boston PBX 41 while the media 

==j 20 stream 46 originating at the SJ PBX 41 is directed to 
I s * UDP port 5004 of the Boston PBX 41. The control 

streams 43, 44 are used by all calls. There is one 
media stream per call from each telephone instrument 
but one control stream handles all calls (i.e., the 
25 control stream is not exclusive to any one particular 
call but used by all calls) . However, while the 
destination port numbers of each media stream will 
always remain constant , the source port number of each 
media stream will be unique for each call. It is the 
3 0 unique source port number that will allow the PBXs 41, 
42 to be able to distinguish one incoming data packet 
from another. A PBX receiving a plurality of data 
packets will review the source port number on each 
packet to sort the packets and group those with the 
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same source port number together. 

When a person in the Boston office wishes to call 
a person in the San Jose office, the Boston person 
picks up a telephone and dials a SJ extension (e.g., 
ext. 586). The Boston PBX 41 searches for ext. 586 in 
its extension table (i.e., configuration database). 
The configuration database lists extension 586 as being 
attached to slot or port number 3 of the SJ PBX 42. 
The Boston PBX 41 then transmits a call control message 
(via TCP/IP port 5002) to the SJ PBX 42 instructing the 
SJ PBX 42 to create a media stream. At this time, the 
Boston PBX 41 does not instruct the SJ PBX 42 to send 
the SJ-to-Boston media stream 46 to any particular port 
number on the Boston PBX 41. Since the Boston PBX 41 
has its own media stream 45 going from Boston to SJ, 
there are now two media streams; one from Boston to SJ 
45 and another from SJ to Boston 46. The control 
stream 43, 44 between the two PBX 41, 42 always exists 
as the control stream is used not only to communicate 
the source port for the media streams, but also for 
other functions not salient to the claimed invention. 
The TCP control stream is always directed to a 
particular destination port- (e.g., 5002) chosen by the 
manufacturer of the system. The use of port 5002 as 
the TCP port is in the source code. This constant 
destination port number may be changed to any other 
number as long as the change is made on a system-wide 
basis . 

When the extension in SJ is dialed, the Boston 
PBX 41 creates a Boston-to-SJ media stream 45. In the 
process of doing this, the SJ PBX 42 sends a message 
over the SJ-to-Boston control stream 44 requesting the 
operating system inside the Boston PBX 41 to transmit a 
unique port number (i.e., the source UDP port number 
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being used by the Boston-to-SJ media stream 45) to 
identify the incoming media stream 45 from the Boston 
PBX 41. The source UDP port number communicated to the 
SJ PBX 42 is one that is not already in use by another 
media stream. Meanwhile, the SJ PBX 42 receives a 
message, over the Boston-to-SJ control stream 43, from 
the Boston PBX 41 requesting the SJ PBX 42 create a 
media stream and a UDP port number to identify that 
media stream. The SJ PBX 42 sends a TCP/IP message to 
Boston with a SJ UDP port number for the SJ media 
stream 46. Thus, the UDP port number from which each 
media stream originates varies per call, regardless of 
whether the media stream originates from the Boston PBX 
41 or the SJ PBX 42. However, both the Boston PBX 41 
and the SJ PBX 42 use the identical UDP port number to 
receive incoming media streams . 

The SJ PBX 42 creates the SJ media stream 46 and 
unique port number which the SJ PBX 42 sends to the 
Boston PBX 41, via call control, identifying that a 
particular call originated at a particular port number. 
The Boston PBX 41 may request that the SJ PBX 42 
provide a source port number or the SJ PBX 42 may, as 
part of the call control of the SJ PBX 42, 
automatically do so when the SJ PBX 42 creates its 
media stream. Both mechanisms are used. Depending 
upon the exact ordering of the events in the call setup 
(or call handoff sequence, for the case of call 
transfer) , one or both of the mechanisms will effect 
the exchange. In the case of both, the excess work is 
discarded by the system. The Boston PBX 41 also has 
the programmed ability to request the UDP port number 
of an extant media stream in order to cover overlap in 
the asynchronous execution of these two symmetric 
algorithms. The destination port number for TCP is 
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always 5002. The designation of which destination port 
number is used for UDP and which is used for TCP all 
comes from software within the PBX. The destination 
port number is compiled into the code running in the 
5 PBXs . At this point, the Boston PBX 41 has a media 

stream 45 uniquely identified by the IP address of the 
Boston PBX 41 and the UDP source port number of the 
Boston PBX 41 obtained when the media stream 4 5 was 
created. In SJ, the SJ media stream 46 is created and 
10 uniquely identified by the SJ IP address and SJ source 
port number obtained. 
O Each system must transfer the UDP source port 

^ number for a particular media stream from its PBX back 

jj? to the PBX at the destination of the media stream. The 

15 UDP source port numbers are not transferred inside the 
f^J media stream. The data inside the media stream must be 

= uniquely identified so that the data is routed to the 

=3. 

™ appropriate destination. The Boston PBX 41 sends the 

f=£ port number of its media stream to the SJ PBX 4 2 in the 

2 0 control stream in a different message (i.e., an 
S=i "endpoint" information message) from the first message 

sent to command the SJ PBX 42 to create the media 
stream. The SJ PBX 42 does likewise. It is the unique 
identification of each media stream that allows the 

2 5 receiver task in a destination PBX to sort through a 

plurality of media streams from various sources and 
send each media stream to its intended destination. 
The key is to communicate the UDP port number from the 
source of the media stream to the destination of the 

3 0 media stream. Every media packet being sent from the 

SJ PBX 42 to the Boston PBX 41 has a destination and 
source IP addresses as well as destination and source 
port numbers. Meanwhile, the Boston PBX 41 is doing 
the same thing (i.e., sending packets with destination 
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and source IP addresses as well as destination and 
source port numbers) . 

Essentially, the Boston PBX 41 sends the UDP port 
number of its media stream to SJ PBX 42 and the SJ 
5 PBX 42 does likewise. Both Boston and SJ send endpoint 
information messages to each other, including an 
identifier of the call and, most importantly, the UDP 
port number of the media stream associated with the 
call. This endpoint information informs each PBX that 
10 voice communication can start. This is the key to 

communicating the UDP port number from the source of 

Z the media stream to a recipient destination. 

y 

'-4 Port 5002 is the port number chosen by the maker 

^ of the PBX for the TCP call control hole in the 

3 15 firewall. Any port number could have been chosen but 

y 

3 once chosen, the port number is fixed across the 

private branch exchange system. Any change in port 
[I number must be made on a system-wide basis. The TCP/IP 

^ messages are sent to the TCP port 5002, which like the 

y 

□ 20 UDP port number, is also chosen arbitrarily. 

™ Port 5004 is recommended by the RTP RFC standard 

for UDP transmission, RFF 1890, RTP Profile for Audio 
and Video Conferences with Minimal Control, January 
1996, http://www. ietf.org /rf c/rf cl890 . txt . However, 

2 5 as stated above, the chosen number does not matter as 

long as the number is fixed across the private branch 
exchange system. The User is usually not authorized to 
change this number. The firewall administrator is 
usually the only one authorized to open a hole at this 

3 0 chosen port number. 

While the data payload of each packet does not 
contain any identifying information, the header of each 
packet does contain IP address and port number 
information. A typical voice packet in the media 
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stream on a Local Area Network (LAN) includes the 
following: Ethernet header (14 bytes) , IP header (20 
bytes, including the destination IP address and source 
IP address) , UDP header (8 bytes, including destination 
5 and source port numbers) , RTP header (12 bytes) , and 
payload (10 to 160 bytes) . 

The two source port numbers, one for each half of 
a conversational media stream, are generated 
asynchronously by the two systems. While the 
10 destination port number varied per each call in prior 
systems, the instant system uses a fixed destination 
port which allows the system administrator to keep the 
H y holes in the firewall to a minimum. The uniquely 

™ generated source port number of each media stream is 

!□ 15 what enables the PBX to identify and distinguish which 

iy 

packet belongs to a particular call. 
5 In accordance with the invention, the actual 

§T mechanism for transmission of the uniquely identifying 

H source port number is through the call control 

1^ 20 mechanism. FIG. 5A illustrates an example of a call 
H control mechanism that operates between at least two 

PBX 50, 54 wherein PBX 50 represents call source PBX 
and PBX 54 represents the call destination PBX. 
However, each PBX in a system is capable of functioning 
2 5 as both a source and a destination PBX. In accordance 
with the invention, as seen in FIG. 5B, a call 
initiated from PBX 54 to a destination at PBX 50 
contains identical functions to that in FIG. 5A except 
that PBX 54 is now the call source PBX while PBX 50 is 
30 the call destination PBX. The ultimate goal of the 
call control mechanism is to exchange source port 
numbers between PBX 50, 54. Three main functions are 
required to accomplish the exchange of source port 
numbers. Each function is represented by a bubble 
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(FIGS. 5A and 5B) and each function operates 
independently of the others. The three main functions 
are multi switch control (ms_ctrl) 51, media stream 
server (ms_svr) 52 , and the call controller 53. The 
5 media stream server 52 and call controller 53 are part 
of the source PBX 50 while the multi switch control 51 
is actually part of the call controller of the 
destination PBX 54. 

Multi switch control 51 is the replication agent 

10 for any existing call from one PBX over to another PBX. 
As stated above, both PBXs run the same functional 
blocks of code, i.e., both the ^calling' PBX and the 
^called' PBX contain ms_ctrl 51, ms_svr 52 and call 
controller 53 (which contains the calling PBX' s 

15 ms_ctrl) . The replication agent is the ms_ctrl within 
the call controller 53 which allows the two (or more) 
PBX to act as if they were a single PBX. 

The media stream server 52 sends and receives RTP 
data (i.e., payload) . The media stream server 52 

2 0 gathers data from the telephone instrument and sends 

that gathered data as RTP data. The media stream 
server 52 also performs the opposite function by taking 
RTP data from the network and converting that data into 
a form compatible with the telephone instrument. 
25 The media stream server (ms_svr) 52 includes five 

tasks: (1) a command task that communicates with the 
call controller 53 to determine the media streams that 
collectively define a single call; (2) a receiver task 
that takes data from the network; (3) a sender task 

3 0 that sends data to the network; (4) a player task that 

works with the receiver task to manage the incoming 
data jitter buffers; and (5) a proxy task that forwards 
the media to voice mail in a format compatible with 
voice mail . 
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As the media stream server 52 and multi switch 
control 51 do not communicate directly with each other, 
the call controller 53 is an intermediary between the 
two functions. The call controller 53 is necessary to 
communicate the source port numbers between media 
stream server 52 and multi switch control 51. 
Communication between the call controller 53 and media 
stream control is focused on call replication 57. Call 
replication is a generic term applied to all 
communication between the media stream server 52 and 
the multi switch control 51 regarding the set-up and 
identification of each media stream. 

As discussed in more detail below, several 
messages run between the media stream server 52 and the 
call controller 53. These messages include "mscreate" 
73, "udp_port" 55, w add_node_req" 5 6, and "set 
endpoint" 76. 

In accordance with the invention, as seen in FIG. 
6, linked lists 61 connect the command task 62, 
receiver task 63, sender task 64, and player task 65. 
The linked lists are an in-memory data structure, which 
is shared by all the running tasks enumerated above. 
Network data 66 enters the media stream server 52 and 
is sent to the receiver task 63. The receiver task 63, 
as will be explained below, sends the data to a 
particular incoming jitter buffer 67, depending on the 
source port number and source IP address of the data. 
A representative jitter buffer is fully disclosed in 
co-pending U. S. patent application Serial No. 
09/314,593 (Attorney Docket No. M-7547 US), entitled 
"Increasing Duration Of Information In A Packet To 
Reduce Processing Requirements", which is herein 
incorporated by reference for all purposes. Each 
incoming jitter buffer 67 is set up for packets from a 
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particular source port number and source IP address. 
The data is then passed to the player task 65 which 
works in conjunction with receiver task 63 to manage 
the buffered data and the incoming data jitter buffers 
5 67. The player task 65 converts incoming network data 
to an analog signal at the speaker 68 of the dialed 
telephone instrument (i.e., correct destination). 

Outgoing data originates from a microphone 6 9A of 
the telephone and is run through a codec 69B. Data 

10 from the codec 69B is then passed to the Digital Signal 
Processor (DSP) 69C. The DSP 69C takes individual 
samples of data from the network buffers, and meters 
them out to Digital -to-Analog Converters (DACs) located 
on the. line card in the telephony switch in an exact 

15 time sequence. The DSP 69C then sends the data to the 
sender task 64 . The sender task 64 breaks up the data 
and places it into a particular network buffer 60 which 
then sends the data out of media stream server (ms_svr) 

52. The data is sent as RTP packets, encapsulated with 
2 0 UDP, IP, and Ethernet headers. The command task 62 is 

in communication with the linked lists 61 and with the 
outside system (not shown) . 

The command task 62 sets up the UDP source port 
number, manages the communication channels running 
2 5 between the command task and call control 53 as well as 
the asynchronous communication with the call control 53 
in the PBX. The command task 62 opens two 
communications channels, or pipes, to call controller 

53. The two pipes, collectively known as x the command 
30 pipe', are the request pipe ("pipe_ms_ctl_cmds" ) and 

the response pipe {" pipe_ms_ctl_resps" ) . The request 
pipe is used to send requests from the call controller 
53 to the command task 62 and the response pipe is used 
to send responses back to the call controller 53 from 
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the command task 62 . The command task 62 reads off of 
the command pipe. Each task (sender, receiver, 
command, proxy) ends up as an infinite loop. 

In accordance with the invention, as seen in FIG. 
5 7, the command task 62 first opens pipes 70 to the call 
controller 53 from the media stream server 52. The 
command task is started at system initialization. The 
command task 62 then waits for one of four possible 
commands from the command pipe 71 (i.e., from the 

10 request pipe) : "media stream create" 73, "add a node" 
74, "add a slot" 75, and "set endpoint" 76. A switch 
statement coming from the command pipe directs which 
command is to be executed 72. 

The first possible command is media stream create 

15 ("mscreate") 73. The end result of this command is the 
creation of a media stream. This command may be 
generated by the call controller 53 in response to 
local or remote telephone being used. In summary, the 
command task 62 calls to "set up RTP header" and 

2 0 generates a UDP source port number which is saved in 
the linked list 61 data structure (See 73A) . An 
outgoing packet template is prepared with a header for 
later use by "sendbuf" 73B. This template is used over 
and over every time a packet is sent as the UDP port 

25 number has been inserted into the packet template 73B. 
After receiving a message "udp_port" 55, mscreate 73 
will then send the UDP source port number back to call 
control 53 for transmission back to remote node 73C. 
The command task then waits for a command from the 

30 command pipe 71. This command task initiates the media 
stream for the call. After that, the next thing to be 
done is to add the extension for the call. The next 
thing the call controller 53 does is "add_slot" which 
adds a slot . 
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The second possible command is to add a node ("add 
node") 74. In summary, "add node" adds a node to the 
system even though it is not aware of which port this 
information is for, informs the system a node was 
5 added, and puts the port information into the media 
stream node (msnode) , searches for the media stream 
identifier (i.e., the unique number of the media 
stream) , finds the media stream node with that 
particular IP address, and sends a reply (to the 
10 request to add a node) to the call controller 53 saying 
that the task of adding a node to the data structure 61 

'jf has been done. This command augments the data 

=y 

-4 structure, or builds the data structure 61, which is 

Y-l shared by all the media stream tasks. This data 
O 15 structure is not strictly for use by the Receiver task 

^ as the sender uses it also for outgoing voice data. It 

s must be noted that the source port number is unknown to 

ijT the destination PBX at this time. A message 

|« " add_node_req" 56, requesting that a node be added, is 

bp 

20 sent to the call controller 53 which is informed when 
M "add_node" is done 77. The command task then waits for 

a command from the command pipe 71. 

The third possible command is to add a slot 75. In 
summary, the "add slot" command is used whereby the 

25 slot is added as a local extension to the media stream 
(i.e., a structure representing the local port is 
linked on to the local media stream) , and then informs 
call control that it is done. 

This command sends a message, " add_slot_req" , from 

3 0 the call controller 53 to the command task. The 

command adds a data structure (i.e., slot) representing 
the local port to the media stream for use by the 
Sender and Player tasks 7 5A. A message is sent 
informing call controller that "add_slot" is done 77. 



-32- 



M-8823 US 
635534 vl 



The command task then waits for a command from the 
command pipe 71 . 

The fourth possible command is to ' x set_endpoint" 

76. This command searches for a media stream based on 
5 a unique number that is the media stream identifier 

(e.g. call identifier) 76A. The command finds the 
remote node ("ms_node" ) having the same IP address and 
no port number 76B. The command then inserts the port 
number into the ms_node list 7 6C. A message is sent 
10 informing call controller that w set_endpoint" is done 

77. The command task then waits for a command from the 
command pipe 71. 

In accordance with the invention, FIG. 8 
illustrates receiver task 63. Receiver task 63 takes 

15 data off the network. The only job receiver task 63 

has is to take packets off UDP 5004 (i.e., the incoming 
source media stream) and send the packets on their way 
to an appropriate incoming data buffer 67 based on the 
source IP and source port addresses of the data. A 

20 "table" in the PBX contains the source IP and source 
port numbers that are matched to local telephone 
extensions. The receiver task 63 listens on port 5004 
for incoming packets, takes a packet in, determines the 
source of the packet based on source IP address and UDP 

25 port number, and then identifies the media stream nodes 
(msnodes) . 

In accordance with the invention, the media stream 
node linked list 61 is used by the receiver task 63 to 
find a particular remote node, as seen in FIG. 11. 
3 0 Once the receiver task 63 matches the source port 

number and IP with the remote nodes, the receiver task 
63 hands over the information to the player task 65 at 
the end of the remote nodes, as seen in FIG. 12. The 
receiver task 63 is involved with the incoming jitter 
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buffers 67 . The ms_node linked list 61 is what binds 
all of these tasks together. 

The receiver task uses the linked list of remote 
nodes and walks through this list to find the 
particular remote node. The receiver task transfers 
the information over to the player task which then 
searches for the media stream which has a pointer to 
the extension list, as seen in FIG. 12. The receiver 
task takes data off UDP port 5004 and determines which 
call (i.e. media stream) each packet belongs to using 
source IP and source port address to identify the 
appropriate media stream and then sends the data to the 
appropriate extension (s) . It then finds the remote 
node ("msnode" in the source code) based on the source 
port number. The task then extracts the sender's UDP 
port number. It then finds the media stream from the 
remote node and converts the data from an encoded 
format. A field in the RTP header designates the 
encoding format. The conversion applied by the 
receiver task is from the encoded format to a linear 
format . 

In accordance with the invention, as seen in FIG. 
8, the receiver task 63 receives data (i.e., packets) 
from the network 81. The receiver task 63 extracts the 
UDP port number of the sender 82. The receiver task 63 
then determines 83 if the UDP port number and source IP 
are found in the ms_node linked list 61. If not 84, 
the data is discarded and the receiver task 63 waits to 
receive more data from the network. If the UDP port 
number and source IP are found in the linked list 61, 
then the data is stored in the incoming jitter buffer 
67 related to the media stream identified by source 
port number and source IP address 85. The receiver 
task 63 waits to receive more data from the network 81. 
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The receiver task 63 accesses the linked list of remote 
nodes (ms-node list) 61 and walks through this list 61 
to find the particular remote node and searches for the 
media stream which has a pointer to the extension list. 
5 The sender task 64 gets data from the handset 

microphones 69A (a CODEC 69B is used to convert the 
data to a usable format) . The sender task 64 takes 
incoming data off local extensions (DSP data 
essentially) . The sender task runs every 10 
10 milliseconds by taking in coming DSP data off the local 
extensions and building the outgoing buffers 60. When 
a: !f a buffer 60 fills up, the sender task 64 sends out the 

■%4 full buffer of payload by using the command, "sendbuf". 

jfj; After building up data from DSP 69C, "sendbuf" searches 

O 15 through the linked list 61 to determine if any media 
;jt streams are related to the full buffer of payload in 

e order to send data out to each remote node that is 

l~[ attached to the media stream whose buffer is full . The 

!** command "sendbuf" attaches the RTP header. The UDP 

•* ^ 

m 2 0 port number is created when the media stream is 
!= created. The UDP header is created and includes the 

UDP port number previously created. UDP source port 
number is filled into buffer. This UDP port number is 
associated with the media stream moving data from the 
25 source telephony switch (e.g., the microphone on a 
handset) to another telephony switch (e.g., to be 
played out of the speaker on another handset) . A query 
is needed to get the number out of the operating 
system. 

30 During "ms_create" in the command task, the 

operating system is instructed to assign a port number 
but the operating system keeps the number to itself. 
The port number assigned from the operating system is 
queried. The port number assigned is saved in the 
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linked data structures, and are used by the sender 
task. 

In accordance with the invention, as seen in FIG. 
9, the sender task 64 waits for the next DSP tick 91. 
5 The sender task 64 then traverses the ("mscblist" ) list 
of all media streams and, for each media stream, 
traverses all local extension numbers (i.e., slots) and 
merges the DSP buffers for the same destination (i.e., 
source bridging of conference data) 92 . The sender 
10 task then traverses ("mscblist" ) the list of all media 
streams to put the data into the network buffers and if 
'3 any network buffer is full, sender task sends to all 

remote nodes of the particular media stream that has a 
" s full buffer 93. Sender task then resumes waiting for 

□ 15 the next DSP tick 91. 

- s - 

l~ As seen in FIG. 6, the player task 65 works in 

5 conjunction with the receiver task 63 to manage the 

1*2 buffered incoming data and incoming jitter buffers. 

S« The player task 65 takes the buffered data and plays 

™ 2 0 the data into the headphone of the telephone 
j=* instrument . 

The proxy task ends up going through "send_buf", 
as discussed above. The proxy task forwards the media 
to voicemail in a form voicemail likes (i.e., proxy 

2 5 task changes the format) . Proxy task sends data to the 

network and stands between voicemail and an extension. 
Proxy task is sending data from one PBX to another PBX; 
10 millisecond packets are changed into 125 millisecond 
packets linearly encoded . All media packets to and 

3 0 from the network uses the constant destination port 

number and varying source port number. 

The call controller 53 encompasses everything 
other than the media stream server 52 and the multi 
switch control 51. In summary, the call controller 53 
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interacts with the command process. The command 
"create call" creates a unique media stream identifier 
used to identify a media stream at the media stream 
layer and then adds parties. The call controller 53 
5 uses the media stream identifier to create a unique 
media stream, gets UDP port number and stores it. If 
remote parties are involved in the call, then the 
remote parties are informed of the unique source port 
number via TCP message through the msctrl mechanism. 
10 However, this message is not sent until call controller 
53 knows there are remote parties. 

3 In accordance with the invention, as seen in FIG. 

™e 10, the call controller 53 waits for a call setup event 

H (local or remote) or an endpoint event 110. If there 

n 

=§ 15 is a setup event, call controller 53 creates a call. 

U This includes creating a unique media stream identifier 

s 

to identify the media stream 120. The call controller 
3 53 then sends a "create media stream" message to the 

^ command process in "ms_svr" module 52 and passes the 

2 0 media stream identifier 13 0 to the command process. 
^ This is step (1) . Once this is done, the call 

controller 53 stores the resulting UDP port number 140 
in a data structure of its own. The call controller 53 
then sends an "add slot" message to the command process 
25 150. This is step (2). The call controller 53 then 
collects digits of the destination phone number 160. 
The call controller 53 then searches a table (i.e., 
configuration database) to find the IP address of the 
PBX switch 54 that handles the destination phone number 
30 170. The call controller 53 then determines if the 
dialed party is at a remote node 180. If the dialed 
party is at a remote node, then call controller 53 
sends an "add node" message to the command task 181 
(this is step (3) ) . This "add node" message adds the 
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remote node to media stream linked list 181. The call 
controller 53 reviews all messages between msctrl 51 
and replicates the call on the remote node by repeating 
steps (1), (2), (3) & (4) 182. Once that is done, call 
5 controller 53 sends a TCP/IP message to the destination 
PBX 54 through the "msctrl" mechanism 51 to inform the 
remote call controller (i.e., destination PBX 54) of 
the UDP source port number 183. The call controller 53 
then waits once again for a call setup event 110. 
10 If the dialed party is not at a remote node, the 

call controller 53 sends an "add slot" message to the 
command process 184 . The call controller 53 then once 
again waits for a call setup event or endpoint event 
110. 

15 If there is an endpoint event, call controller 53 

sends a "set endpoint" message to the command process 
190. The call controller 53 again waits for a call 
setup event or endpoint event 110. 

There are alternative ways to identify the source 

2 0 of each call other than by source port number. As 

stated above, the RTP header in each media stream 
packet is 12 bytes in size. The RTP header is 
comprised of 4 bytes of time stamp, a 2 byte sequence 
number, 4 bytes for synchronous source identifier 
25 (SSRC) , and 2 bytes for RTP header extensions. 

The SSRC can be used to uniquely identify the 
source of media stream being sent. However, the SSRC 
can be changed at any point by the media stream 
generator. Systems could use the SSRC to identify the 

3 0 media stream source but the method to do so is more 

complex than identifying by source port number. In 
order to use the SSRC, the media generator requires a 
control channel to communicate the changes in SSRC in a 
time-sensitive, synchronized fashion in order to avoid 
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loss of data. The source could change the SSRC in the 
middle of a call. However, if someone else uses the 
same identifier, the media streams of each user using 
the same identifier collide and both sources have to 
5 use new SSRC numbers. Thus, the SSRC is one way to use 
the RTP header to identify media streams. 

The header extension could be used to identify the 
source of the media streams but this method is less 
efficient since the network data representing the 
10 stream will be larger, consuming more network 

bandwidth. This method may also be less efficient when 
deployed with available routing equipment, which, upon 
recognition of such a header extension, may cease to 

^ compress packets traveling over a slow network link. 

in 

q 15 The 2 bytes in the RTP header for extensions can 

= describe the encodings for payload type and the 

= existence of merged conference streams (i.e. conference 

calling) . These 2 bytes can also indicate the 
i=& existence of extensions (i.e., whether the header has 

2 0 been extended) . These two bytes can be made to stand 

SB 5 

for whatever the user wants the two bytes to indicate, 
including, but not limited to, the source of the RTP 
data . 

The above-described embodiments of the present 
25 invention are merely meant to be illustrative and not 
limiting. It will thus be obvious to those skilled in 
the art that various changes and modifications may be 
made without departing from this invention in its 
broader aspects. Therefore, the appended claims 
30 encompass all such changes and modifications as falling 
within the true spirit and scope of this invention. 
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